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Abstract 

We show how the evolving set methodology of Morris and Peres can be used to show Cheeger 
inequalities for bounding the spectral gap of a finite Markov kernel. This leads to sharp versions of 
several previous Cheeger inequalities, including ones involving edge-expansion, vertex-expansion, 
and mixtures of both. A bound on the smallest eigenvalue also follows. 
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1 Introduction 

Given a finite, irreducible reversible Markov kernel P the Perron- Frobenius theorem guarantees that the 
matrix P has a real valued eigenbasis with eigenvalues 1 = Ao(P) > Ai(P) > • • • > A„_i(P) > —1. The 
spectral gap A = 1 — Ai(P) between the largest and second largest eigenvalues, or in the non-reversible 
case the gap A = 1 — Ai ( — ^p- ) of the additive symmetrization, governs key properties of the Markov 
chain. Alon p], Lawler and Sokal [5], and Jerrum and Sinclair [4 J showed lower bounds on the spectral 
gap in terms of geometric quantities on the underlying state space V, known as Cheeger inequalities. 
Similarly, in the reversible case Diaconis and Stroock [3] showed a lower bound on 1 + A n _i, known 
as Poincare inequalities, which also have a geometric flavor. 

Inequalities of both types have played an important role in the study of the mixing times of 
Markov chains. Conversely, the authors of [7] used their Evolving set bounds on mixing times to 
show a Cheeger inequality, although this was later removed as it was weaker than previously known 
bounds. We improve on their idea and find that our resulting Theorem 13.21 can be used to show sharp 
Cheeger-like lower bounds on A and 1 + A n _i, both in the edge-expansion sense of Jerrum and Sinclair, 
the vertex-expansion notion of Alon, and a mixture of both. The bounds on A typically improve on 
previous bounds by a factor of two, which is essentially all that can be hoped for as most of our bounds 
are sharp; the notion of edge-expansion used in our Cheeger inequality for 1 + A n _i is entirely new. 

The paper is organized as follows. In the preliminaries we review some mixing time and Evolving 
set results. This is followed in Section[3]by our main result, an Evolving set generalization of Cheeger's 
inequality. In Section [4] this is used to show a sharp version of the edge expansion Cheeger Inequality 
and to improve on vertex-expansion bounds of Alon and of Stoyanov. Similar bounds on A n _i, and 
more generally the second largest magnitude eigenvalue, are found in Section [5j 
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2 Preliminaries 



Consider a finite ergodic Markov kernel P (i.e. transition probability matrix) on state space V with 
stationary distribution it. This is called lazy if P(x,x) > 1/2 for every x £ V, and is reversible 
if P* = P where the time-reversal P*(x,y) = ~^y^^ • The ergodic flow from A C V to B C V 
is Q(A, B) = YlxeAyeB 7: ( x )^'( x ->y)- The total variation distance between distributions o and ir is 
[|<7 — vr||Ty = ~ X^ey l "^) ~~ 7r ( :E )l- The rate of convergence of a reversible walk is related to spectral 
gap 01 E| by 

11 1 A n 

- (1 - A)" < - \ n max < max ||pS - vr|| ry < - . max (2.1) 
2 2 iceF 2 mm^gy TT{y) 

where p"(y) = P n (x,y) and X max = max{Ai(P), |A„_i(P)|}. 

Morris and Peres [7] introduced a new tool for studying the rate of convergence: 

Definition 2.1. Given set A C V a step of the evolving set process is given by choosing u £ [0, 1] 
uniformly at random, and transitioning to the set 

A u = {y e V : Q(A, y) > U7r(y)} = {y G V : P*(i/,A) > u} . 

The walk is denoted by Sq, ^x, S2, ■ ■ •, 5^, with transition kernel K n (A, S) = Prob(S n = S\Sq = A), 
and expectation E n f(S n ) = J2s n cV Kn ( s o, S n ) f(S n ). 

The main result of [7] is a bound on the rate of convergence in terms of Evolving sets: 

Lemma 2.2. If So = {x} for some x G V then 
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Hp" - k\\tv < n 1 N E nV / min{7r(,S' n ),l-7r(5' n )}. 
2 ir(x) 

A few easy lemmas of theirs will be required for our work, both of which the interested reader 
should have little trouble in showing. First, a Martingale relation: 

Lemma 2.3. If A C V then 

-\ 

7r(AJ du = ir(A) . 







Note from the definition that for a lazy walk A u C A if u > 1/2, while A u D A if u < 1/2. The 
gaps between A and A u are actually related to ergodic flow: 

Lemma 2.4. Given a lazy Markov chain, if A C V then 

r i/2 ,1 
Q(A,A C )= (7T(A u )-7r(A))du= (ir(A)-ir(A u ))du. 

JO Jl/2 



3 A Generalized Cheeger Inequality 



Recall that a Cheeger inequality is used to bound eigenvalues of the transition kernel in terms of 
some geometric quantity. "The Cheeger Inequality" in the finite Markov setting generally refers to 
the bound 

j,2 Q(A A c ) 

A > 1 - VI - h 2 > — where h = min v ; . (3.2) 
~ ~ 2 Acv, it (A) 

n(A)<l/2 
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The quantity h is known as the Cheeger constant, or Conductance, and measures how quickly the 
walk expands from a set. We now show a generalization of the Cheeger inequality which is expressed 
in terms of Evolving sets. The Cheeger constant is replaced by /-congestion: 



Definition 3.1. If / : [0, 1] — > M+ and A C V the /-congestion of A is given by 

CM) 



tif(n(A u ))du 



f{<A)) ■ 

The /-congestion is given by Cf = max^ c y Cf(A). 

Small /-congestion corresponds to a rapid change in set size of the Evolving set process. In [6] 
it is found that to study many measures of convergence rate (total variation, relative entropy, chi- 
square, etc.) there correspond appropriate choice of Cf. The /-congestion is thus closely related to 
convergence of Markov chains, which in part explains why our main result holds: 

Theorem 3.2. Given a finite, irreducible, reversible Markov chain, and / : [0, 1] — ► R+ then 

A > 1 — Xmax > 1 — Cf . 

If Va G (0, 1/2) : /(a) < /(l — a) then it suffices to let Cf = max^M^x^ Cf(A). 



Proof. Given x G V let So = {x} and M = max^^^o, l 2g7^73]) ' wnere 9 '■ — * ^+ i s some 
function to be defined later. Then by Lemma 12.21 



M 



7T X 



M 

< —rr E n _ 1 g'(7r(S' n _i))C 5 (S' n _ 1 ) 

7r(x) 

Mg(7T(x)) 

7T(X) 9 ' 

The final inequality followed from C g (S n -i) < C g , induction, and 5o = {x}. 
But then, by equation (|2,ip . 




2 max ||p™ — tt\\tv < Cg \ % I max rr I * C g . 



Mg(Tr(x))\ 
ir(x) 



If Va G (0, 1/2) : /(a) < /(l - a) then let g(a) = /(min{a, 1 - a}), noting that since tt((A c ) u ) = 
lim^o 1 - n(Ai- u+ s) then J 5(7r(( J 4 c ) u )) du = Jq g(it(A u )) du. Otherwise let g(a) = f{a). □ 

Remark 3.3. For a non-reversible walk Theorem \3.S\ holds with 1 — X ma x replaced by 1 — \*, where 
A* = maxj>o |Aj| is the second largest magnitude (complex-valued) eigenvalue of P. This follows from 
the related lower bound 

-AJ < max ||p™ -tt\\ tv 
2 xeV 

(see e.g. While intriguing, it is unclear if lower bounds on 1 — A* have any practical application. 

Remark 3.4. An anonymous reader notes that rather than using lower bounds on variation distance 
we could instead use the well known-relation p(A) = lim/^oo for spectral radius in terms of 

a consistent matrix norm satisfying \\Av\\ < C \\ A\\ \\v\\ . In this case take A = P — E where E is the 
matrix with rows all equal to ir, and total variation norm has C = 2. 
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4 Cheeger Inequalities 



Special cases of Theorem 13.21 include bounds of the vertex type as in Alon [TJ, the edge type as in 
Jerrum and Sinclair [4], and mixtures of both. The key to the reduction will be the following lemma: 

Lemma 4.1. Given a concave function f : [0, 1] — ► M and two non-increasing functions g, g : [0, 1] — > 
[0,1] such that g{u)du = g(u)du andyt G [0,1] : f^g(u)du > f^g(u)du, then 

/ f°g(u)du< / fog( u )du. 
Jo Jo 

Proof If x > y, 5 > then A = 1 - a +2g G [§, 1] with x = (1 - A) (y - <5) + A (a? + <5) and 
y = A (y - 6) + (1 - A) (x + 5). Concavity of / implies that f(x) > (1 - A) f(y - 5) + A f(x + 5) and 
/(y) > A/(y - <5) + (1 - A) /(a; + 5). It follows that 

Vx > y, 5 > : f(x) + f(y) > f(x + 5)+ f(y - 5) . (4.3) 

The inequality (|4.3|) shows that if a bigger value (x) is increased by some 5, while a smaller 
value (y) is decreased by <5, then the sum f(x) + /(y) decreases. In our setting, the condition that 
Vi G [0, 1] : J ' <?(ii) du > ^(n) cZtt shows that changing from g to g increased the already large values 
of g(u) when it is small, while the equality g{u) du = g(u) du assures that this is canceled out by 
an equal decrease in the already small values when u is big. The lemma then follows from (|4.3p , □ 

It remains to relate the /-congestion to the edge or vertex notions of Cheeger constant, and then 
choose the optimal function /. 



4.1 Edge expansion 

We first consider edge-expansion, i.e. ergodic flow, and in particular derive a bound in terms of the 
symmetrized Cheeger constant 

h = min h(A) where h(A) - Q ( A > A ^ 



Acv tt(A)tt(A c ) 

To do this a somewhat stronger bound will be shown, and then a few special cases will be considered, 
including that of h. 

Corollary 4.2. Given function f : (0, 1) — * M + such that f and f" are concave, and Va G (0, 1/2) : 
/(o) < /(l — a) then the spectral gap of a finite, irreducible Markov chain satisfies 

A > min 



n(A)<l/2 -f(7T(A))/f"(7T(A)) • 

Without condition f{a) < f{\ — a) the result holds with minimum taken over all proper subsets ofV. 

Proof. First consider the reversible, lazy case. By Lemma 12.41 and the remarks before it, Q(A,A C ) is 
the area below tt{A u ) and above it (A), and also above tt(A u ) and below tt(A). By Lemma 14.11 the 
value Cf(A) is maximized when ir(A u ) = m(u) where m(u) is as in the first diagram of Figure [TJ 
Then 

A>1— Cf > 1 — max — — — — . (4.4) 

} - acv 2/(tt(A)) 1 ; 
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rc(Au) m(u) "(AJ m(u) 




u 



Figure 1: Extreme cases, Vi € [0, 1] : Jq tt(A u ) du > Jq m(u) du for lazy and non-lazy walks. 



In the general case consider the lazy, reversible Markov chain P' = i fj + p ~t, p ). Then apply 
equation (|4,4p to P' and observe that A = 2Ap' and Q(^4, A c ) = 2Qp'(A, A c ), to derive the relation 

A > 2 fl- m ax^ + Q ^^ + /^- Q ^^^ (4.5) 

. . Q(A,^ C ) 2 

> mm 



AcV -f(7T(A))/f"(7T(A)) 



The second inequality required that f(x + 5) + f(x — 5) < 2 f(x) + f" (x) 5 2 . To show this let F(y, 5) = 
f"(y) 5 2 + 2 f(y) - f(y + 5)- f(y - 5). Observe that &F(y, 5) = 2 f"(y)5 - f(y + 5) + f(y - 5) = 
at 5 = 0, and £sF(y,S) = 2f"{y) - (f"(y + 5) + f"(y - 5)) > as /" is concave. Hence F(y,<S) is 
convex in 5 with minimum at 5 = 0, and therefore (5) > F(y, 0) = 0. □ 

Note that (|4.5p holds even if /" is not concave. 

Given h then Q(A, A c ) > }nr(A)7r(A c ) \/A C V. To apply Corollary 14.21 to give a lower bound 
A > ch 2 we look for a solution to — f (x) / f" (x) < c _1 x 2 (l — x) 2 where /, /" concave and c G The 
best solution to this (i.e. maximal c) is c = 1/4 and fix) = \Jx(l — x). A more direct computation 
leads to a slightly stronger result. 

Corollary 4.3. The spectral gap of a finite, irreducible Markov chain satisfies 

h 2 



fc>A>2fl-yi- h 2 /Aj > — . 

Proof. The upper bound is classical. The second lower bound follows from the first because y/l — x < 
l — x/2. For the first lower bound, consider C r^z^ an( ^ a PPty equation (|4.5|) . Then, for some A C V, 



Jl + h{A)ir(A c )\/l - h(A)ir(A) + \/l - h(A)7r(A c )\/ 1 + h(A)ir(A) 

A ^ 2 I 1 



To simplify this let X = \{1 + h(A) tt{A c )) and Y = ±(1 - h(A) ir(A)) in Lemma[53J □ 

The Corollary is sharp on the two-point space u — v with P(u, v) = P(v, u) = 1 and h = 2. Bounds 
in terms of h typically show at best A > 1 and so cannot be sharp for the two-point space. 

Different choices of f(a) work better if more is known about the dependence of h{A) on set size. 
For example, for a walk on a cycle it is better to choose f{a) = sin(-7ra). 
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Example 4.4. Consider the reversible random walk on the cycle C n = Z/nZ of length n with P(i, iztl 
mod n) = 1/2. If A C C n then Q(A,A C ) > 1/n, and so the Cheeger inequality (|3.2p gives the bound 
A > h 2 /2 = 2/n 2 , while Corollary [O] improves this slightly to A > /i 2 /4 = 4/n 2 or even the sharp 
A > 2 when n = 2. To apply Corollary 14,21 directly we solve the differential equation —f/f" < c^ 1 , or 
/" + cf > 0. The largest value of c is obtained by the concave function f(a) = sin(-7ra) with c = it 2 . 
Although /" is not concave the function / can still be used in Equation (|4.5p to obtain 

IT 2 

A>2 min 1 - cos(7r Q(A, A c )) = 2(1 - cos(7r/n)) m . 

7r(A)<l/2 n 2 

A more refined argument can be used to determine A exactly. As before, consider P' = ;p- Then 

A P = 2A P / > 1 - C sin{7ra)i p/ = 1 - cos(27r/n) , 
which is the correct value of A. 



4.2 Vertex-expansion 

The Generalized Cheeger inequality can also be used to show Cheeger-like inequalities in terms of 
vertex-expansion (the number of boundary vertices), leading to sharp versions of bounds due to Alon 
PQ, Bobkov, Houdre and Tetali [2] and Stoyanov [8]. 
Two notions of vertex-expansion are required: 

Definition 4.5. If A C V then the internal and external boundaries are di n (A) = {x G A : Q(x, A c ) > 
0} and d ou t(A) = di n (A c ) = {x £ A c : Q(x,A) > 0}. The internal and external vertex expansion are 



where 



h in = min h in {A) and h out = min h out {A) 

7t(j4)<1/2 n(A)<l/2 



h in (A) = \ " and h out {A) - 



n(A) ' n(A) 

Quantities hi n and hi n (A) are defined similarly, but with tt(A)tt(A c ) in the denominator. The minimum 
transition probability Po = mm Xf L ye y{P(x, y) : P(x,y) > 0} will also be required. 

Theorem 4.6. The spectral gap of a finite, reversible Markov kernel satisfies 

A > 1 - a/1 - h out P - P + hout ~ lj > ^ min {h 2 out , h out ] 

A > l-y/T+ h~P ~ Po (y/l -h in -l)>^ h 2 



in 



A > 1 



\ 2 
hin Po \ 




I I — I - II > P 0(1 + P0) ~ h 2 



in 



For the non-reversible case replace Pq by Pq/2. 



Proof. First consider the reversible lazy case. Given A C V, a vertex x £ A is in di n {A) if and only 
if P(x, A c ) > 0, which happens if and only if Q{A,x) < (1 — Po)tt(x), if and only if x ^ A u for every 
u > 1 — Pq. Thus, given only hi n (A), that tt(A u ) is non-increasing and that tt(A u ) du = tt(A), then 
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Figure 2: Extreme case for P with di n (A), and P* with d ou t(A). 



the integral J Q * tt(A u ) du is minimized for all t G [0,1] if vr(^4 u ) = m(it) is as in the first diagram of 
Figure El 

Finish by applying Lemma [4.1l to upper bound in terms of h{ n or C ^y a ^_ a ^ m terms of hi n . For 
h-out upper bound using the second diagram of Figure [2j As before, the non-lazy case is reduced 
to the lazy case by the relation Ap = 2Ap/ where P' = while the non-reversible case is reduced to 
the reversible case by the relation Ap = Ap» where P" = p+p . □ 

To compare this to previous bounds we note that Stoyanov [8], improving on results of Alon p] 
and Bobkov, Houdre and Tetali [2], showed that a reversible Markov chain will satisfy 



A > max 



|^ (i- ^i-h m y , ^ (yi + /w-i) 2 j 



I Po ,2 Po • r.2 7 i 

> max i — h in , — mm{h out , h out \ 

Our Theorem 14.61 and the approximations y/T— h out Po < 1 — h out Po/2 and y/T+ hj^Po < l + /ij n Po/2, 
give a stronger bound for reversible chains, 

A > — max |l - \J\ - h in , + h out - 1 j 

A > max | y h? n , ^ min{^ f , /w} j . 

Remark 4.7. T/ie and h ou t bounds in this section were not sharp, despite our having promised 
sharp bounds. This is because C /^j^aj — ^\fa, ^ s a better quantity to consider. If C r^jz^j were used 
instead of then we would obtain sharp, although quite complicated, bounds; these bounds simplify 
in the h and h{ n cases which is why we have used C ^/ a ^ 1 _ a ^ f or those two cases. Bounds based on 
C ^ a (j_ a ) are sharp on the two-point space u — v with P(u,v) = P(v,u) = 1. 

4.3 Mixing edge and vertex expansion 

We can easily combine edge and vertex-expansion quantities, and maximize at the set level rather 
than at a global level. For instance, in the reversible case 

f 1 ~ P P 1 

A > min max | ^ h{A) 2 , h in (A) 2 , — mm{h out (A) 2 , h out (A)} j . 

Alternatively, we can apply Lemma 14.11 directly: 
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Theorem 4.8. The spectral gap of a finite, reversible Markov kernel satisfies 



A > min 

n(A)<l/2 



2 - P Q ^1 -h m {A)- P y/1 + h out (A) 



h(A) - P h out (A) 



1 + 



1 



A > min 2 

■k(A)<1/2 



'N 



+ 



HA) 



+ (1 



MA) - PoMi) 

2(1 - P ) 



-For i/ie non-reversible case replace Pq fry Po/2- 



The proofs are no different from that of the cases already dealt with, other than that the worst 
cases m(u) are somewhat more complicated, and so we omit the proofs. As in Remark 14.71 the first 
bound can be made sharp (and even more complicated) by working with 1—C ^ a ^_ a y while the second 
bound is already sharp on the two point space. 



5 Bounding the smallest eigenvalue 

The generalized Cheeger inequality can also be used to bound 1 — X ma x for a reversible walk, by 
examining P directly instead of the lazy walk P' = 4p as before. Techniques of the previous sections 
carry through if modified expansion quantities are used, such as the following: 

Definition 5.1. If A C V then its modified ergodic flow is defined by 

\P(A) = ~ f \n(A u )-Tr(A)\du. 

2 Jo 



The modified Cheeger constant H is given by 

h = min h(A) where h(A) 



*(A) 



AcV 



it(A)it(A c 



By Lemma l2~4l for a lazy chain = Q(A, A c ) and hence also h(A) = h{A). 

We can now show a lower bound on the eigenvalue gap: 

Theorem 5.2. Given a finite, irreducible Markov chain then 

l - A* > l - Vi-h 2 > h 2 /2 . 

Proof. Let pa G [0, 1] be such that ir(A u ) > n(A) if u < pA and n(A u ) > n(A) if u > pa- Then 
= r o PA (vr(A u ) - it (A)) du = J* (tt(A) - tt(A u )) du because tt(A u ) du = tt(A). Apply Lemma 
14.11 with the second figure of Figure |j] to obtain 



C^^(A) < yj (p A + h(A) 7r(^)) [p A - h(A) vr(A) 



+J 1 - pa - ^(A) tt(A c ) 1 - pa + tt(A) 



To finish let X = p A + h(A) ir{A c ) and Y = p A - h(A) it (A) in Lemma [53 



□ 
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For an isoperimetric interpretation of this, note that [6] showed that 

*(A)= min Q(A,B) + 7T{AC) ~^ B ' } Q(A,v). 

tt(B)<it{A c )<it(BUv) 

Hence, to bound spectral gap A consider the worst-case ergodic flow from a set A to its complement 
A c , whereas to bound A* use the worst-case ergodic flow from a set A to a set the same size as its 
complement A c . 

If we choose the /-congestion carefully then even better bounds may be achieved. 

Example 5.3. Consider the cycle walk of Example 14.41 For x = ^ < \ then minjMWj. ^(A) = 
Q{A X , B x ) when A x = {0, 2,4, . . . , 2k - 2} and B x = A X U {-1, -2, -3, . . . , -n + 2k}. Then h = 
Q(^i/2, ^1/2) = if n is even and so 1 + A n _i > 0, while h = — n ^ n * s oc ^ an( ^ so ^ + ^ n — 
fr 2 /2 > 2/n 2 . 

To improve on this, note that a bound similar to the lower bound of Corollary 14.21 holds for ^f(A) 
as well. Since ^(A) > l/2n for all A C V, this again suggests taking f(a) = sin(7ra), and so if n is 
odd then 

1 - ^max > 1 - C sin (7ra) = 1 ~ COs(27T^(^ln-i )) = 1 - COS ( - ) 

K ' 2n \n/ 

This is again an equality. 

Vertex-expansion lower bounds for 1 — A* (and hence also 1 — X ma x) hold as well. For instance, if 
Po = min Xjyg v'{P(x, y) : P(x,y) > 0} (note that x = y is permitted) and 

tt({x G B : Q(A,x) > 0}) 

«o«t = mm mm — 

tt(A)<1/2tt(B)=tt(A c ) ir(A) 

then 1 - A* > y§ mm{hl ut , h out ). 

Example 5.4. A vertex-expander is a lazy walk where hout > e > 0. Analogously, we might define a 
non-lazy vertex-expander to be a walk where h out > e > 0. If the expander is regular of degree d then 

1 _ \ ^ mm{h 2 out , hput] e 2 
Amax ~ 12d ~ I2d ' 

which (up to a small constant factor) generalizes the relation 1 — A max > e 2 /4d for the lazy walk. 
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Appendix 

The following lemma was used for a few simplifications but was left for the Appendix. 
Lemma 5.5. IfX,Y € [0,1] then 

VXY + y/(l-X)(l-Y) < y/l-{X-Yy. 

Proof. 

(\/iy + V(l-I)(l-F)) 2 = 1-{X + Y) + 2XY 

+ ^[1 - (X + Y) + 2XY] 2 - [1 - 2(X + Y) + (X + Y) 2 ] 
< 2[l - (X + Y) + 2 X Y] - [l - 2(X + Y) + (X + Y) 2 ] 
= 1 + 2XY - X 2 -Y 2 = 1 - (X - Y) 2 

The inequality follows from the relation \/a 2 — b < a — b if a 2 > b, a < and a > b (square both 
sides to show this), applied with a = 1 - (1 + 7) + 217 and 6= 1 - 2(X + Y) + (X + Y) 2 . □ 
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